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Abstract 

We propose an analysis of corrections which models some of the require- 
ments corrections place on context. We then show that this analysis nat- 
urally extends to the interaction of corrections with pronominal anaphora 
on the one hand, and (in)definiteness on the other. The analysis builds 
on previous unification-based approaches to NL semantics and relies on 
Higher-Order Unification with Equivalences, a form of unification which 
takes into account not only syntactic /3r;-identity but also denotational 
equivalence. 

Wir schlagen eine Analyse vor, die einige der Anforderungen von Korrek- 
turen an den Kontext modelliert und sich natiirlich auf die Interaktion 
von Korrekturen mit Pronominalanaphern und Undefiniertheit erweitern 
lafit. Die Analyse basiert auf bekannten unifikationsbasierten Ansatzen 
fiir die Semantik natiirlicher Sprache und benutzt eine Erweiterung der 
Unifikation hoherer Stufe. Diese beriicksichtigt nicht nur strukturelle firj- 
Gleichheit, sondern auch logische Aquivalenz. 



1 Introduction 

Corrections are utterances such as (^d) where a discourse participant corrects 
the utterance of some other discourse participant]^ 

(1) a. A: Jon likes Mary. 

b. B: No, PETER likes Mary. 



Although there is much hterature on corrections (e.g. [3JS77, Nor91, EIB82| ), a 



thorough investigation of their hnguistics is still outstanding. In this paper, we 



build up on [vL94] and examine some of the requirements corrections place on 
context or in other words, the relationship between correction (the correcting 
utterance) and correctee (the utterance being corrected). For instance, it is 
clear that the pair of utterances in (^ does not form a well-formed dialog. 

(2) a. A: Jon likes Mary. 



^Here and in what follows, we use capital letters to indicate prosodic prominence. 



b. * B: No, PETER likes Sarah. 



On the other hand, it is also clear that a simple equality requirement between 
the semantic representation of the deaccented part of the correction and that of 
its parallel counterpart in the source is not appropriate either: 

(3) a. A: Jon likes [the woman with the red hatji 
b. B: No, PETER likes Sarahi 



Here the correction contains an NP Sarah whose semantic representation is not 
identical with that of its source parallel element the woman with the red hat. 



In other words, a requirement such as |Sag76|'s alphabetical variant constraint 
would fail0. At this stage one could be tempted to conclude that the equality 
requirement is a semantic one: the deaccented part of the correction must be 
semantically equivalent with its parallel correlate in the source utterance. How- 
ever, this is also incorrect. Thus in the property denoted by the VP in 
(^) need not be the same as the property denoted by its parallel counterpart 
in (^): whereas the VP in (|^a) denotes the property of loving Jon's wife, the 
VP in (|b) may denote the property of loving Peter's wif^. 

(4) a. A: Joni loves hisi wife. 

b. B: No, PETER loves Ms wife. 



In short, it is clear that some identity requirement is needed to appropriately 
characterise the relation between correctum and correction (cf. example]^). On 
the other hand, it is less clear what this identity requirement should be (cf. 
examples In this paper, we contend that the correct notion of identity 

is given by Higher-Order Unification with equivalences, a form of Unification 
which takes into account not only syntactic identity, but also denotational equiv- 
alence. We show that the HOUE-based analysis of corrections we propose, not 
only captures some of the contextual requirements of corrections, but also makes 
appropriate predictions about the interaction of corrections with both pronom- 
inal anaphora and (in)definiteness. 



2 HOU with Equivalences 



Now we will briefly review higher-order unification and its properties, for de- 
tails we refer the reader to |Sny91|. Higher-order unification solves the problem 

■^Sag proposes an analysis of VP ellipsis which requires that the semantic representation of 
a VP ellipsis be an alphabetical variant of the semantic representation of its antecedent. The 
basic assumption is that semantic representations are A-terms. Two terms are alphabetical 
variants of each other iff they are identical up to renaming of bound variables. 

''This is of course similar to the sloppy/strict ambiguity charact eristic of VP ellipsis. In- 



deed, as we shall later see, our treatment is very similar to DSP91 



's treatment of VPE. 



2 



of finding substitutions cr that for a given equation A = B make both sides 
equal in the theory of /3?7-equahty {(t{A) —p^ a{B)). Huet's weU-known al- 



gorithm |Hue75] solves the problem by recursively decomposing formulae and 
binding Function variables to most general formulae of a given type and given 
head. 

However, even though HOU considers /3ry-equality of formulae, it does not take 
into account the semantics of the logical connectives and quantifiers contained in 
the logical representation of natural language utterances. For this we need a uni- 
fication algorithm for /Jry-equality augmented by logical equivalence. Obviously, 
such an algorithm has to generalize theorem proving methods for higher-order 
logic, since the task of unifying an equation [A V ^A) = T, where T is a sen- 
tence, is equivalent to proving the validi ty of th e theorem T^. An algorithm 
that solves this problem is describ ed in |Koh95 |. It is a generalization of the 



first-order Tableau method | Fit9C | for automated theorem proving, which re- 
futes a negated theorem by analyzing the connectives in an and/or tree and 
finding instantiations that close each branch of the tree by finding elementary 
contradictions on it. 

Instead of a formal recapitulation of the tableau method, we discuss the example 
of the logical theorem (p(a)Vp(&) => 3x.p{x)). The negation of this is equivalent^ 
to the formula at the root of the following tableau. 

p{a) V p{h) A \fx.-ip{x) 
p{a)yp{b) 
\/x.-ip{x) 



p{a) 
*[y = a 



p{b) 
-^p{z) 



Here we see that conjuncts are simply added to the branch, whereas disjuncts are 
analyzed in separate branches of the tree. The scopes of universal quantifications 
(with new variables) can be inserted at the end of branches, the same is possible 
with the scopes of existential quantifications (with the bound variables replaced 
by Skolem]^ terms) . Finally, both branches of the tableau are closed, i.e. the 
last formula can be instantiated (by the substitution in brackets) so that it 
contradicts a formula in the branch above. 

These instantiations are computed by unification, and in the case of higher- 



order logic by HOU. The distinguishing feature of the HOUE algorithm [Koh95| 
is that intermediate equations {A — B) of type t (generated either by unifying 
two formulae on the branch to make them contradictory or by processing other 
unification problems) can be transformed into negated equivalences (which can 

*The formula (A V ^A) must be true in all models, so T can only be equivalent to it, if it 
is a theorem. 

^In addition to the de Morgan laws we use the identity 3x.A = ^Vx.^A. 

''Skolem terms serve as witnesses for the objects whose existence is claimed by the exis- 
tential formula A. Since this object may depend on the values of free variables x\, . . . ,Xn 
occurring in A, they have the form f(x\, . . . ,x„) where / is a new function. 
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then be treated by the theorem proving component). Actually, tableau devel- 
opment for the negated equivalence -^{A <^ B) contains trivial branches, so we 
use the following (optimized) rule, which splits an equation of type t into two 
tableau branches 

A = B 



A 



B 
-nA 



This way, HOU and tableau theorem proving recursively call each other in 
HOUE, until a refutation is found (all branches of the tableau are closed). 



3 The basic analysis 

Typically, a correction partially or completely repeats a previous utterance and 
one of its characteristic properties is that the repeated material is deaccented, 
that is, it is characterised by an important reduction in pitch, amplitude and 



duration (cf. |Bar95||). Our proposal is to analyse corrections as involving a 



deaccented anaphor which consists of the repeated material. Furthermore, we 
require that the semantic representation of a deaccented anaphor unify with the 
semantic representation of its antecedent. 

More precisely, let SSem and TSem be the semantic representations of the 
source (i.e. antecedent) and target (i.e. anaphoric) clause respectively, and 
TP^ . . . TP'\ SP^ . . . SP"^ be the target and source parallel elementsj], then the 
interpretation of an SOE must respect the following equations: 

An(5Pi,...,5'P") = 55em 
An(rF\...,rP") ^TSem 

Intuitively, these two equations require that target and source clause share a 
common semantics: An, the semantics of the deaccented anaphor. We illustrate 
the workings of the analysis by a simple example. Given the dialog in (|^), the 
equations to be solved are: 

An{j) = like{j, m) 
An{p) = like{p, m) 

Given these equations, HOU yields a unique solution An = Xx.like{x,m). In 
contrast, the equations required for the analysis of example (^ are: 

An{j) — like{j, m) 
An{p) = like{p, s) 



^As in [DSP91|, we take the identification of parallel elements as given. 
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Since there is no substitution of values for free variables which simultane- 
ously makes An{j) a/Jjy-identical with like{j, m) and An(p) a/^Ty-identical with 
like{p, s), unification fails thereby indicating the ill-formedness of (|^). 

4 Corrections and pronominal anaphora 

The resolution of pronouns occurring in the destressed part of a correction 
appears to be subject to very strong parallelism constraints. For instance in 
), the pronoun her can only be understood as referring to its source parallel 
element Sarah - else it must be stressed. 

(5) a. Jon loves Sarahi . 

b. No, PETER loves her. 

Intuitively, there is a simple explanation for this: if the destressed part of a cor- 
rection is a repeat of its parallel element in the source utterance, then pronouns 
occurring in it must necessarily resolve to their parallel counterpart in the source 
expression. As we shall see, the picture is somewhat more complex however. In 
some cases, a destressed pronoun in the correction may be ambiguous. In other 
cases, it functions as a paycheck pronoun. Finally, extraneous factors such as 
scope constraints and world knowledge interact with the semantics of correc- 
tions in determining the resolution of destressed pronouns. In what follows, we 
show how HOUE allows us to correctly predict this array of empirical facts. 

4.1 Pronouns 

Let us start with example (||) above. Given the analysis of corrections described 
in section ^ the equations to be resolved are^: 

An{j) = love{j,s) 
An{p) — love{p,x) 

By unification, the only possible values for An and x are Xy love{y, s) and s 
respectively. That is, the destressed pronoun is resolved by unification to its 
parallel element in the source utterance, Sarah. As required. 

In some cases however, a destressed pronoun in the correction is ambiguous. 
For instance in (^) , the pronoun his may resolve either to Jon or to Peter. 

(6) a. Joni loves hisi wife. 

^Unresolved pronouns are represented by free variables i.e. variables whose value is deter- 
mined by unification. Alternatively, pronouns could be resolved first and unification would 
then function as a filter on admissible resolutions. 
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b. No, PETER 2 loves hisi_2 wife. 



Interestingly, such cases are similar to the sloppy/strict ambiguity^ characteris- 
tic of VP ellipsis and as |DSP91| have shown, HOU straightforwardly captures 
such cases because of its ability to yield several solutions. In the case of (^, 
the analysis proceeds as follows. First, the following equations must be resolved: 



An{j) = love{j, wof {j)) 
An{p) = love{p, wof (x)) 



Resolution of the first equation yields two values for A/jp^ \y love{y, wof{j)) 
and \y love{y, wof{y)). By applying An to p, we then get two possible values 
for An(j)): love{p, wof{j)) and love{p, wof{p)). As a side effect, the pronoun 
his represented by x is resolved either to Jon or to Peter. In short, for such 
cases, the multiple solutions delivered by HOU match the ambiguity of natural 
language. 



4.2 Paycheck pronouns 

Destressed pronouns whose source parallel element is a pronominal possessive 
NP are particularly interesting. At first sight, they seem to behave just like 
any other destressed pronouns occurring in a correction, that is, they seem to 
resolve unambiguously to their parallel source element. For instance, in ^jo), 
the most likely resolution of her is Jon's wife. 



(7) a. Joni likes hisi wife. 

b. No, PETER likes her (= hisi wife) 



However, a closer investigation of the data suggests that this reading is a kind 
of default reading which is preferred out of a pair of two grammatically possible 
interpretations. To see this, consider examples (||) and @. 

(8) a. Joni broke hisi arm yesterday. 

h. No, PETER2 broke it (= hisi,2 arm) yesterday. 



^The terminology sloppy /strict originated with ]Ros67| ]. Intuitively, a pronoun has a strict 



interpretation if it denotes as its antecedent. By contrast, a pronoun which denotes differently 
from its antecedent is said to have a sloppy interpretation. 

^"Unification yields a third value for An, namely Xy love{j, wof{y)). This solution however 
ijS rilled out by the second equation. More generally, we assume a restriction similar to 



|DSP91 's Primary Occurrence Restriction (PGR): the occurrences directly associated 
with the contrastive elements are primary occurrences and any solution containing a primary 
occurrence is discarded as linguistically invalid. For instance, in An(j) = love{j, wof (j)), 
the first occurrence of j is a primary occurrence so that the solution An = \ v love d, wof(y)) 
is ruled out. For a proposal of how the PGR can be formally modelled, sec [ |GK9e| ]. 
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(9) a. Joni had hisi nose remodelled in Paris. 

b. No, PETER2 had it (~ his2 nose) remodelled in Paris. 



Although these examples are structurally identical with (Q), they differ in the 
interpretation of the destressed pronoun occurring in the correction. Whereas 
only allows for a strict interpretation of this pronoun, (|^) permits both a 
strict and a sloppy interpretation whilst (^) only admits of a sloppy reading. 

Our contention is that a destressed pronoun in the correction whose source 
parallel element is a possessive definite, is systematically ambiguous between a 
strict and a sloppy interpretation. However extraneous factors may have the 
effect that only one reading is available. For instance, in (j^) the strict reading 
is ruled out by our world knowledge that one can only have one's own nose 
remodelled. As for (^), the absence of sloppy reading can be explained if we 
assume that the interpretation of a destressed anaphor follows a default strategy 
geared toward maximal semantic identity between the destressed anaphor and 
its antecedent. Under this assumption, the strict reading is the most natural 
since it establishes a strict denotational identity between the antecedent VP 
likes Jon's wife and the destressed anaphor likes her. 

The behaviour of these pronouns is simply explained once they are viewed as 
paycheck pronouns as illustrated by Karttunen's famous example (cf. p^ar69 |): 



(10) The man who gave his paycheck to his wife was wiser than the man who 
gave it to his mistress 

Paycheck pronouns differ from other pronouns in that they can neither be seen 
as coreferential constants nor as bound variables - instead they pick up the 
definite description introduced by their antecedent and reanchor its possessive 
pronoun in its immediate context. For instance in (^0|) above, the paycheck 
pronoun it picks up the description his paycheck and reanchors its possessive 
pronoun his to the second occurrence of the man. 

There are various ways in which paycheck pronouns can be accounted for but 
essentially, the idea is that their denotation is fixed by a definite description 



containing either an unresolved pronoun or an unresolved property. As |Coo79| 
convincingly argues, the second solution is methodologically more satisfactory. 
We will therefore assume that paycheck pronouns are definite NPs whose rep- 
resentation includes a free variable of type (e — > t) i.e. a property. More 
specifically, we assume that a paycheck pronoun is assigned the following repre- 
sentation: 

XQ.3x[Pix) A yy[P{y) ^ y ^ x] A Q{x)] 

where P G wif(e^t)- Given this, the analysis of (0) runs as follows. The 
equations to be resolved to check the well-formedness of the destressed anaphor 
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likes her are:Q 

^'T-(j) = =la;[ wof(x,j) A unique(x) A love{j, x)] 
An{p) ~ 3x[P{x) A unique(x) A love{p, x)] 

Resolution of the first equation yields the two values Xy3x[ wof{x,j) A 
unique{x) A lovely, x)] and \y3x[ wof(x,y) A unique(x) A loveijj^x)] for An 
and thus, the values 3x\ wof{x,j) A unique{x) A love{p, x)], and 3x[ wof{x,p) A 
unique(x) A Zo?;e(p, a;)] for An(p). The first result yields the strict reading (Pe- 
ter loves Jon's wife) whereas the second yields the sloppy reading (Peter loves 
Peter's wife). 



5 Corrections and definiteness 

So far, we have only considered cases where the semantic representation of the 
destressed anaphor could syntactically unify with that of its antecedent. That is, 
in each case it was possible to find a substitution of values for free variables which 
made the two semantic representations a/^Ty-identical. In this section, we turn 
to more semantic cases, cases in which the relation between destressed anaphor 
and source parallel element is one of denotational - rather than syntactical - 
identity. Definites are a primary example of such a phenomenon: since one and 
the same individual can be referred to by several, distinct definite descriptions, 
it often happens that the definite description used in the destressed part of a 
correction is not structurally identical with the description used in its source 
parallel element. This is illustrated in example (^l]) where the source utterance 
contains the definite the woman with the red hat. As illustrated by ([l^a-d) , the 
parallel element in the correction can be his wife, her, the neighbour's daughter 
or Sarah. In each case, the description does not syntactically unify with the 
source description the woman with the red hat. Note however that the correction 
is only well-formed when the parallel descriptions are interpreted as referring to 
one and the same individual (cf. the ill-formedness of ([ll]e-g)). That is, when 
they are semantically equivalent. 

(11) Jo7^2 likes [the woman with the red hat]i 

a. No, PETER3 likes his wife NPi) 

b. No, PETER likes heri . 

c. No, PETER likes [the neighbour's daughterji. 

d. No, PETER likes Sarah. 

e. * No, PETER likes her^. 

f. * No, PETER likes Mary4. 

g. * No, PETER likes him. 

^^In what follows, we abbreviate XQ.3x[P{x) A Vj/[P{j/) ^ y = x] A Q{x)] to XQ.3x[P{x) A 
unique{x) A Q{x)]. 
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How does HOUE account for such examples? To show this, we now sketch the 
main steps of the unification process for example (Ulp) with equations: 



An{p) ~ like{p, s) 

^n-U) — 'ix{w{x) A wrh{x) A unique{x) A like{j, x)) 

These are solved in a context, where Sarah is the only woman with a red hat. 
The HOUE method is given access to the hypotheses unique{s), w{s) and wrh{s) 
by adding them to the initial tableau. In a first step, we solve the first equation 
to An = Xz.like(z, s) and obtain the following tableau: 

unique{s) 
w{s) 
wrh{s) 
An{p) = like{p, s) 

like(j, s) — 3x{w{x) A wrh{x) A unique{x) A like{j, x)) 

The HOUE rule discussed in section |^ now splits the initial equation into two 
branches. The first one has the form 



-iu'(z) 
* [z = s] 



like{j, s) 
■^3x{w(x) A ... A like{j, x)) 



^wrh{z) 

*[z = si 



-iunique{z) 

*[z = s\ 



-^like{j, z) 
*\z = s] 



and contains the formulae like{j, s) and {-3x{w{x) A wrh[x) A unique{x) A 
like{j, x))). The latter is universally quantifiedp] and can therefore be developed 
into four branches -'w{z), -i'wrh{z), -^unique{z) , and -ilike^j, z). The first three 
branches can be closed using the hypotheses on Sarah and the last one with the 
first formula, all by binding the new variable z to s. The second branch has the 
form 

-^like{i, s) 
3x{. . . A unique{x) A like{j, x)) 
unique{c) 
like{j, c)) 



c = s 
like{j, s) 
*[] 

and consists of the formulae ^like{j, s) and 3x{w{x) A wrh{x) A unique{x) A 
like{j,x)), which is developed into the single branch containing the conjuncts 
w{c), wrh{c), unique{c), and like{j,c)), where c is a Skolem constant for x. 
Here an expansion of the definition of uniqueness 

unique{x) <^ Vz(w(z) A wrh{z) ^ a; = z) 



We use that ^3x.A is equivalent to \/x.—<A here. 
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closes the branch (if Sarah and c are unique, then s = c). 

By now, it should be clear that our treatment will also encounter no particu- 
lar problem in dealing with examples such as (|lj) and (|l^) below. The first 
example relies on the world-knowledge that marrying is a symmetric relation 
(both partners have to say "yes I do"), whereas the second relies on the fact 
that getting wounded is synonymous to being hurt by someone/thing. Once 
these equivalences are taken into account, the HOUE analysis of corrections 
will correctly predict that these examples are well-formed. 

(12) a. A: Jon married Sarah 

h. B: No, Sarah married PETER 

(13) a. A: Sarah hurt Paul. 

b. B: No, PETER was wounded. 



We have seen that a deaccented anaphor must either have a semantic represen- 
tation which syntactically unifies with that of its antecedent, or be semantically 
equivalent to this antecedent. To show that this is a necessary condition, we 
need to provide some ill-formed examples in which neither condition holds. 
Such examples are given when the correction contains a destressed pronoun 
whose source parallel element is either an indefinite (^4|) or a quantifier (p^). 

(14) a. Jon eats ani apple. 

b. * No, PETER eats iti. 

(15) a. Jon kissed mosti women at the party yesterday, 
h. * No, PETER kissed themi . 



In both cases, the semantic representation of the pronoun in the correction 
fails to syntactically unify with the semantic representation of its antecedent. 
Neither can it be proved that it and them are semantically equivalent to an apple 
and most women at the party respectively. Therefore, unification fails correctly 
ruling out ( p^ and ([T^). The logical reason for this e.g. in ([l^), is that while the 
second equation An{p) — eat{p, y) can be solved to An = \x.eat{x, y) yielding 
the negated -^{eat{j, y) <^ 3x{ap{x) A eat(j, x)), this cannot be refuted 



"Example is in fact ambiguous between a specific reading of the indefinite an apn le 
and a non-specific one. In the first case, the indefinite denote uniquely so that it in (Mb) 
refers to this unique apple. Since it is denotationally equivalent with its antecedent, HOUE 
will succeed. In the second case, there is no unique apple salient in the context, hence it and 
an apple cannot be denotationally equivalent. Therefore HOUE fails. The above discussion 
focuses on this second possibility. 
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6 Conclusion 



In a sense, it would be much more natural to express the proposed analysis in 
a dynamic setting (cf. | Kam8lJ ). The data discussed in section ^ clearly shows 



that definite, indefinites and quantifiers behave differently wrt. corrections. The 
intuition is that whereas, a definite can bind a pronoun in the correction (cf. ex- 
ample |ll|), indefinites and quantifiers cannot (cf. examples |l^,|l5|). These are of 
course precisely the sort of facts dynamic semantics was designed to deal with: 
if we assume that the correctee-correction pair is semantically represented by 
a disjunction ($ V ^'), then a definite in the correctee will be able to bind an 
anaphor in the correction (because definites have global scope) whereas indef- 
inites and quantifiers won't (because traditionally disjunction is static and the 
discourse referents introduced by one disjunct are not accessible to the other 
disjunct). In this paper, we've shown that such facts could be modelled by 
means of HOUE on static semantic representations; it would be interesting to 
see how the analysis would transpose to a more dynamic setting. This how- 
ever must await the development of Higher-Order Unification for a dynamic 
lambda-calculus . 

Another question worth investigating is whether the interleaving of anaphora 



resolution and quantification proposed in [ DSP91 could account for the data 
considered here. The approach has the advantage that it does not resort to 
equivalences, thus permitting better computational properties. However, unless 
definites are treated in a special way, it is unlikely that the approach will be 
able to capture examples such as ( |ll|) where denotational equivalence, rather 
than strict unification, is required. 

Finally, an interesting issue concerns the relationship between HOUE and ac- 
commodation. A simple way to model accomodation would be to posit that, 
as theorem proving hits a dead-end, accomodation can be used to close off a 
branch: the accomodated fact is the fact needed to derive a contradiction and 
close off this tableau branch. Naturally, this idea is too simplistic in that some 
model must be defined which constrains accomodation. This we leave as an 
open research issue. 
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